A Meta-Analysis Based Method for Prioritizing Candidate Genes Involved in a Pre-specific Function

نویسندگان

  • Jingjing Zhai
  • Yunjia Tang
  • Hao Yuan
  • Longteng Wang
  • Haoli Shang
  • Chuang Ma
چکیده

The identification of genes associated with a given biological function in plants remains a challenge, although network-based gene prioritization algorithms have been developed for Arabidopsis thaliana and many non-model plant species. Nevertheless, these network-based gene prioritization algorithms have encountered several problems; one in particular is that of unsatisfactory prediction accuracy due to limited network coverage, varying link quality, and/or uncertain network connectivity. Thus, a model that integrates complementary biological data may be expected to increase the prediction accuracy of gene prioritization. Toward this goal, we developed a novel gene prioritization method named RafSee, to rank candidate genes using a random forest algorithm that integrates sequence, evolutionary, and epigenetic features of plants. Subsequently, we proposed an integrative approach named RAP (Rank Aggregation-based data fusion for gene Prioritization), in which an order statistics-based meta-analysis was used to aggregate the rank of the network-based gene prioritization method and RafSee, for accurately prioritizing candidate genes involved in a pre-specific biological function. Finally, we showcased the utility of RAP by prioritizing 380 flowering-time genes in Arabidopsis. The "leave-one-out" cross-validation experiment showed that RafSee could work as a complement to a current state-of-art network-based gene prioritization system (AraNet v2). Moreover, RAP ranked 53.68% (204/380) flowering-time genes higher than AraNet v2, resulting in an 39.46% improvement in term of the first quartile rank. Further evaluations also showed that RAP was effective in prioritizing genes-related to different abiotic stresses. To enhance the usability of RAP for Arabidopsis and non-model plant species, an R package implementing the method is freely available at http://bioinfo.nwafu.edu.cn/software.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification and evaluation of HvPIP1; 4 and HvnsLTP genes expression for drought tolerance in barley

It is of great significance to understand the tolerance mechanisms by which plants deal with drought stress and application of these mechanisms for improvement of genotypes in response to drought stress. In order to identify and investigate the expression of genes involved in tolerance to drought stress, leaf and root EST were analyzed in Spontaneum (wild barley) and Nimruz (tolernt to drought ...

متن کامل

Identification of the drought tolerance involved candidate genes in foxtail millet through an integrated meta-analysis approach

Drought stress is one of the most important factors limiting production in the agricultural sector. Due to the need to use smart agriculture adapted to climate change, the use of drought-tolerant alternative plants with high water use efficiency is of great importance. Foxtail millet (Setaria italica L.) is one of the important drought tolerant fodder and food grains in semi-arid regions. In th...

متن کامل

بررسی ژن‌های مشترک سرطان پستان و چاقی به‌روش اولویت‌بندی ژن‌های کاندیدا

Background: Cancer and obesity are two major public health concerns. More than 12 million cases of cancer are reported annually. Many reports confirmed obesity as a risk factor for cancer. The molecular relationship between obesity and breast cancer has not been clear yet. The purpose of this study was to investigate priorities of effective genes in the molecular relationship between obesity an...

متن کامل

I-3: Tale of The Tail: Candidate Genes Involved in Sperm Flagella Formation

Background ISTS defect in which sperm tail is short and fibrous sheath and axoneme are disorganized, is one of the syndromes that cause male infertility. Although a few studies have been done in this regard, its exact etiology in human is unclear yet. Four candidate genes causing ISTS are SPEF2, RABL2B, and A-kinas anchoring proteins genes (AKAP3 and AKAP4). Proteins which coded by SPEF2 and RA...

متن کامل

A Model for Prioritizing the Risks Associated with Road Construction Projects Based on Generalized Secondary Goal

   This paper aims at providing a new model based on Data Envelopment Analysis (DEA) to prioritize project risks. It is clear that the large amounts of involved capitals, the long term of infrastructure projects’ implementation, and the project management problems in on-time completion of projects indicate the necessity of paying particular attention to this issue and conducting applied researc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2016